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Abstract. Modal formulae express monadic second-order properties on Kripke frames, 
but in many important cases these have first-order equivalents. Computing such equiva- 
lents is important for both logical and computational reasons. On the other hand, canon- 
icity of modal formulae is important, too, because it implies frame-completeness of logics 
axiomatized with canonical formulae. 

Computing a first-order equivalent of a modal formula amounts to elimination of second- 
order quantifiers. Two algorithms have been developed for second-order quantifier elim- 
ination: SCAN, based on constraint resolution, and DLS, based on a logical equivalence 
established by Ackermann. 

In this paper we introduce a new algorithm, SQEMA, for computing first-order equiva- 
lents (using a modal version of Ackermann's lemma) and, moreover, for proving canonicity 
of modal formulae. Unlike SCAN and DLS, it works directly on modal formulae, thus 
avoiding Skolemization and the subsequent problem of unskolemization. We present the 
core algorithm and illustrate it with some examples. We then prove its correctness and 
the canonicity of all formulae on which the algorithm succeeds. We show that it succeeds 
not only on all Sahlqvist formulae, but also on the larger class of inductive formulae, in- 
troduced in our earlier papers. Thus, we develop a purely algorithmic approach to proving 
canonical completeness in modal logic and, in particular, establish one of the most general 
completeness results in modal logic so far. 
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Introduction 

The correspondence between modal logic and first-order logic has been studied exten- 
sively, see for example |29| . Every modal formula defines a (local or global) second-order 
condition on frames, but it is well known that many of these can in fact be equivalently re- 
duced to first-order conditions. Modal formulae for which this is possible are called elemen- 
tary. As Chagrova has shown in [S], the class of elementary modal formulae is undecidable. 
Hence, any attempt at an effective characterization of this class can be an approximation 
at best. 

The best-known syntactic approximation is the class of Sahlqvist formulae |2T] . the first- 
order correspondents of which can be computed using the Sahlqvist-van Benthem method 
of substitutions, see e.g. and 

The Sahlqvist formulae are properly included in the (still syntactically defined) inductive 
formulae, introduced in jl5j and ^7]; for a survey on these, see also [3]. The complex 
formulae (\2<j\) represent yet another extension of the Sahlqvist-class. The formulae in these 
three classes have the important property of being canonical, and hence of axiomatizing 
complete logics. 

Computing first-order equivalents of modal formulae amounts to elimination of the 
second-order quantifiers in their semantic translation. Thus, algorithmic classes of ele- 
mentary modal formulae can be generated by algorithms for elimination of second-order 
predicate quantifiers, such as SCAN (see ^21EIli20j) and DLS (first introduced in 9 as 
an extension of an algorithm presented in |23] , see also [01 12()| \'2^ and ^Hl ) • Each of them, 
applied to the standard translation into second-order logic of a negated modal formula, 
attempts to eliminate the existentially quantified predicate variables in it and thus to com- 
pute a first-order correspondent. Both SCAN and DLS have been applied to compute the 
first-order equivalents of standard translations of modal formulae, and can be regarded as 
the first purely algorithmic approaches to the first-order correspondence theory of modal 
formulae. ^] and [2^ were soon followed by Simmons' algorithm presented in |23j as an 
extension of Sahlqvist-van Benthem method of substitutions (for the Simmons algorithm 
see also jH]). Simmons' algorithm is also applicable to some non-elementary modal formu- 
lae (e.g. modal reduction schemes such as McKinsey's formula), but the output formulae 
produced by it generally involve Skolem functions. 

Briefly, SCAN accomplishes its task on an existential second-order input formula by 
removing the existential second-order quantifiers, skolemizing away the existential first- 
order quantifiers, and transforming the result into clausal form. It then attempts to obtain 
an equivalent formula in which the predicate variables do not occur anymore, by running a 
constraint resolution procedure to generate sufficiently many first-order consequences and 
by applying deletion rules. The introduced Skolem functions are then eliminated from 
the result, if possible. SCAN fails to find a first-order equivalent either if the constraint 
resolution phase does not terminate, or if it cannot unskolemize. It has been proved in ^H] 
that SCAN can successfully compute the first-order equivalents of all Sahlqvist formulae. 

The algorithm DLS is based on the following lemma by Ackermann, see 

Lemma 0.1 (Ackermann's Lemma). Let P be a predicate variable and A{x,'z) and B{P) 
be first-order formulae such that there are no occurrences of P in A{x,'z). If P occurs only 
negatively in B{P) then 

3P(Vx[A(x,z) ^ P{x)] A B{P)) = B{A{t,z)/P{t)) 
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and, respectively, if P occurs only positively in B{P), then 

3P(Vx[P(x) ^ A{x,z)] A B{P)) = B{Ait,z)/P{t)) 

where 'z are parameters, and each occurrence of P(t) in B on the right hand side of the 
equivalences, for terms t, is substituted by A(t,'z). 

The algorithm DLS performs various syntactic manipulations, including skolemization, 
on its input in order to transform it into a form suitable for the application of Ackermann's 
lemma. If that cannot be accomplished, the algorithm fails. When all predicate variables 
have been eliminated it may be necessary to unskolemize the resulting formula in order to 
obtain a first-order equivalent. If it cannot do that, the algorithm fails. 

Both SCAN and DLS have been implemented and are available online. For in-depth 
discussion of the theory and implementation details of these algorithms see QSlt (for SCAN) 
and HB] (for DLS). 

In this paper we present a new algorithm called SQEMA (Second-Order Quantifier 
Elimination for Modal formulae using Ackermann's lemma) for computing the first-order 
frame correspondents of modal formulae. It is influenced by both SCAN and DLS, but unlike 
them it works directly on modal formulae, rather than on their standard translation, and 
computes their (local) first-order equivalents by reducing them to (locally) equivalent pure 
formulae in a hybrid language (see e.g. [2] or Jl]) with nominals and inverse modalities. 
This is done using a modal version of Ackermann's lemma. This approach originated from 
j27j (see also where Ackermann's lemma was rephrased in the framework of solving 

equations in Boolean and modal algebras. The fact that modal formulae are not translated 
has many advantages, e.g.: 

• it avoids the introduction of Skolem functions, and hence the need for unskolemiza- 
tion, which is in general undecidable. Nominals play the role of Skolem constants, 
but disappear when translating the resulting pure hybrid formulae into first-order 
logic. 

• it enables the use of a stronger version of Ackermann's lemma, where instead of 
polarity (positive or negative), modal formulae are tested for monotonicity (upwards 
or downwards) in propositional variables. The extension of SQEMA based on the 
stronger version of Ackermann's lemma is not presented in this paper, but only 
briefly discussed in the concluding remarks in Section El and developed in more 
detail in the sequel to the present paper [£]. 

• it shows that the (suitably extended) modal language is rich enough to enable the 
computation of the first-order equivalents of many modal formulae. 

• it allows one to terminate the execution of the algorithm at any stage and return a 
(hybrid) modal formula with partly eliminated propositional variables, in an exten- 
sion of the basic modal language with inverse modalities and nominals. 

• it allows for a uniform proof of the correctness and canonicity of the formulae on 
which the algorithm succeeds. 

The rest of the paper is organized as follows: In Section 1 we introduce the languages 
we will use, together with some basic notions, and present a modal version of Ackermann's 
lemma. In Section 2 we introduce the algorithm SQEMA, followed in Section 3 by some 
examples of its execution on various input formulae. The correctness of the algorithm and 
the canonicity of the formulae on which it succeeds are proved in a uniform way in section 4, 
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and its completeness for the classes of Sahlqvist and inductive formulae is shown in section 
5. We conclude in section 6 by discussing some extensions of SQEMA. 

1. Preliminaries 

1.1. Syntax, semantics and standard translations. The basic modal language ML is 
built from _L and a countably infinite set of propositional variables (or atomic propositions) 
PROP = {po,Pi, ■ ■ ■} as usual: 

(f ::= _L I T I p I -k^ {ipX/iplLpAipl 0(p \ □(^. 

An occurrence of a modal operator or a subformula in a formula ip has positive 
polarity (or, is positive) if it is in the scope of an even number of negations; respectively, it 
has negative polarity (or, is negative) if it is in the scope of an odd number of negations. 

The connectives and are defined as usual. Note that we take more logical 
connectives as primitives than necessary, in order to avoid syntactic complications when 
introducing rules for each of them, but whenever suitable, we will consider some of them 
definable in terms of the others, as usual. 

A Kripke frame is a pair ^ = (W, R) where is a non-empty set of possible worlds 
and R C W"^ is an accessibility relation between possible worlds. A Kripke model based on 
a frame 5^ is a pair M. = (5, V), where V : PROP — > 2^ is a valuation which assigns to 
every atomic proposition the set of possible worlds where it is true. 

The truth of a formula if at a possible world u of a Kripke model Ai = {W,R,V), 
denoted hy Ai,u \\- cp, is defined recursively as follows: 

M,u\^ T; 

M,u\\- p iS u e V{p); 

M,u II- ^ip iS M,u 1/ ip; 

M, u\\- ipy Tp iS M, u\\- ip OT M,u Ih Tp; 

M, u\\- (f Aip iS M, u\\- ip and M, u Ih ip; 

Ai, u Ih Oip if ^A,w Ih p for some w &W such that Ruw; 

M, u Ih Up M..,w \V- p for every w such that Ruw. 

A modal formula ip is: 

• valid in a model Ai, denoted I h 95, if it is true at every world of Ai; 

• valid at a world u in a frame ^, denoted ^, u Ih (p, if it is true at u in every model 
on 5^; 

• valid on a frame ^, denoted ^ \\- (p, if it is valid in every model based on ^; 

• valid if it is valid on every frame; 

• globally satisfiable on a frame ^, if there exists a valuation V such that V) Ih p. 
A general frame is a structure ^ = {W, R, W) where {W, R) is a frame, and W is a 

Boolean algebra of subsets of 2^, also closed under the modal operators. The elements of 
W arc called admissible sets in ^. 

Given a general frame ^ = {W, R, W), a model over 5^ is a model over (W, R) with the 
valuation of the variables ranging over W. 

Given a modal formula tp, a general frame ^, and w G W, we say that ip is (locally) 
valid at w in ^, denoted ^, w Ih tp, if i/j is true at w in every model over ^. 
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Following [221 define Lq to be the first-order language with =, a binary predicate R, 
and individual variables VAR = {xo,xi, . . .}. Also, let Li be the extension of Lq with a set 
of unary predicates {Pq, Pi, . . .}, corresponding to the atomic propositions {po,pi, ■ ■ ■}■ 

The formulae of ML are translated into Li by means of the following standard transla- 
tion function, ST(-, •), which takes as arguments an ML-formula together with a variable 
from VAR: 

ST(pi,x) := Pi{x) for every pi £ PROP; ST{^ip,x) := ^ST((/?,x); 
ST((/p V V, x) := ST{ip, x) V ST(V', x); ST{^ A V, x) := ST{ip, x) A ST(V', x); 
STiOif, x) := 3y{Rxy A ST((^, y)), ST{a^, x) := ^y{Rxy ^ ST((^, y)), 
where y is the first variable in VAR not appearing in ST(ip,x). 

Now, for every Kripke model M, w A4 and G ML: 

M,w\^ip iff M^ST{^,x){x -.= ^0), 

and 

IH (/? iff >i ^ VxST(v?,rc). 
Thus, on Kripke models the modal language is a fragment of Li. 
Further, for every frame d, U! £ d and if € ML 

^,wh{p iff ^ \= yPST{{p, x){x := w), 

and 

d\^(p iff N VPVxST((/j,x), 

where P is the tuple of all unary predicate symbols occurring in ST{(p,x). Thus modal 
formulae express universal monadic second-order conditions on frames. 

A modal formula if £ ML is: 

• locally first- order definable, if there is a first-order formula a{x) such that for every 
frame ^ and w G 5 it is the case that ^,10 \\- ip iE ^ \= a{x := w); 

• (globally) first-order definable, if there is a first-order sentence a such that for every 
frame ^ it is the case that ^ \\- ip iS ^ \= a. 

Two modal formulae are: 

• semantically equivalent if they are true at exactly the same states in the same 
models; 

• locally frame equivalent if they are valid at exactly the same states in the same 
frames; 

• frame equivalent if they are valid on the same frames. 

For the execution of the algorithm we enrich the language ML by adding: 

• the inverse modality with semantics A4,u Ih n~^ip iff A4,w Ih ip for every 
w € M such that R~^uw, i.e. Rwu. We extend the standard translation for the 
inverse modality in the obvious way: ST(n~-^(^, x) := ^y{Ryx —>■ ST((^, y)), and 
define as the dual of D^^. 

• nominals (see a special sort of propositional variables Nom = {ii,i2,...} 
with admissible valuations restricted to singletons. The truth definition of nominals 
is: {W,V),u Ih i iff V{i) = {u}. The standard translation: ST(ij,j;) := x = yi, 
where yo,yi, ■ ■ ■ are reserved variables associated with the nominals Iq, ii, . . .. 
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The SO extended modal language will be denoted by ML"*". 

Let be a model and (/p a formula from ML+. We denote {(pIm = ^ : M, m Ih 
i.e., \^\m is the extension of the formula (p in the model Ad. 

A pure formula in ML^ is a formula which contains no propositional variables, but 
which may contain nominals. Every pure formula 7, is locally first-order definable by the 
formula VyST(7,a;), where y is the tuple of all variables yi corresponding to nominals ij 
occurring in 7. 

Let A,B{p) be formulae in ML^. Then B(A/p), hereafter also written as B(A), is the 
formula obtained from B{p) by uniform substitution of A for all occurrences of p. 

Although formally ML"*" is sufficient for the execution of our algorithm, it is sometimes 
useful to further enrich the language with the universal modality [U], with semantics A4,u\\- 
[U](/9 iff M,w Ih (f for every w G M, i.e. iff M. Ih (f, and define (U) as the dual of [U]: 
Ai,u Ih {U)(p iff Ai,w Ih if for some w & M iS A4,u II — ^[U]^(^. The standard translation 
is extended accordingly: ST([U](/?, x) := \/xST((p, x) and ST((U)(/9, x) := 3xST(ip, x). In 
particular, we use the universal modality in the next sub-section to re-phrase the modal 
version of Ackermann's Lemma and compare it with the method of substitutions. 

1.2. Modal version of Ackermann's Lemma. Prom now on we will work in ML"^, unless 
otherwise specified. Let AT = PROP U NOM. 

A formula (p is positive (negative) in a propositional variable p if all occurrences of p 
in if are in the scope of an even (odd) number of negations, if is positive (negative) if it 
is positive (negative) in all propositional variables. Note that nominals are not taken in 
consideration here, and a pure formula is both positive and negative. 

Informally, a formula if is upwards monotone in a propositional variable p if its truth is 
preserved under extensions of the interpretation of p. Similarly, cp is said to be downwards 
monotone in p if its truth is preserved under shrinkings of the interpretation of p. Here are 
the formal definitions. 

Definition 1.1. (Monotonicity of a formula) A formula ip is upwards monotone in a 
propositional variable p if for every frame all states w & ^, and all valuations V and V' 
on ^ such that V{p) C V'{p) and V{q) = V'{q) for all 9 G AT, g 7^ p, if V),w\\-(p then 
{^,V'),w Ih if, or, cquivalently |<^l(5,y) C |<^](5^y'), where [</'](5,y) denotes the extension of 
(p in the Kripke model V). 

Similarly, a formula ip is downwards monotone in a propositional variable p if for every 
frame ^, all states w e ^, and all valuations V and V on ^ such that V'{p) C V{p) 
and V{q) = V'{q) for all q G AT, q ^ p, if \\- ip then ^, V' Ih or, equivalently 
M(^y) Q M(s,V')- 

Note that: 

• the negation of a positive (resp. upwards monotone) formula is a negative (resp. 
downwards monotone) formula. 

• every formula positive in p is upwards monotone in p, and respectively, every formula 
negative in p is downwards monotone in p. 

• the result of substitution of upwards (resp. downwards) monotone formulae for 
the variables in any upwards monotone (in particular, positive) formula is upwards 
(resp. downwards) monotone. In particular, p{p) is upwards (resp. downwards) 
monotone in p iff p{-^p) is downwards (resp. upwards) monotone in p. 
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Lemma 1.2 (Modal Ackermann Lemma). Let A,B{p) be formulae in ML^ such that the 
propositional variable p does not occur in A and B{p) is negative in p. Then for any model 
Ai, A4 \\- B(A) iff A4' \\- {A ^ p) A B(p) for some model M' which may only differ from 
Ai on the valuation of p. 

Proof If M Ih B{A), then M' Ih {A ^ p) A B{p) for a model M' such that \p\m' = IMm- 
Conversely, if M' Ih {A ^ p) A B[p) for some model M' then M' Ih B{A/p) since B{p) is 
downwards monotone. Therefore, ^A\\- B{A/p). □ 

An analogue of this lemma can be formulated for positive formulae i?, too. We note that 
a somewhat different version of Ackermann's lemma has been proved for modal first-order 
formulae in (25|, where it is also applied to some modal formulae. 

1.3. Ackermann's Lemma and the substitution method. Note that the contraposi- 
tive form of the modal Ackermann's lemma (after replacing -^B with B) is equivalent to: 

Vp([U](^^p)^i?(p)) ^ B{A/p), 

for any modal formula A not containing p, and a modal formula B which is positive with 
respect to p. To see that equivalence, note that [U](yl p) is true in a model M. iff 
{A}m c Mm. 

The statement above can be interpreted as follows: [U](74 p) ^ B{p) is valid in a 
given frame iff B{p) is true for the 'minimal' valuation satisfying the antecedent, viz. A. 
This is precisely the technical idea at the heart of the substitution method! 

2. The algorithm SQEMA 

In this section we formally introduce the core algorithm SQEMA. To reduce technical 
overhead in the exposition, the algorithm will be presented in the basic modal language. In 
[3] SQEMA is extended to arbitrary polyadic and hybrid multi-modal languages. 

2.1. The core algorithm. Given a modal formula 99 as input, we transform -k^ into 
negation normal form. We then apply the algorithm's transformation rules (all of which 
preserve local frame-equivalence), to formulae of the form a — > /3, with a and (5 both in 
negation normal form, which we call 'equations'^. All equations are interpreted as global 
statements (i.e., valid in the whole model). The algorithm works with systems of equations. 
Each transformation rule transforms an equation into one or more new equations. The 
Acker mann-rule, based on Ackermann's lemma, is applicable to a whole system of equations. 
The idea is to choose a propositional variable and transform the current system of equations, 
using the transformation rules presented further, into a system to which the Ackermann-rule 
can be applied with respect to the chosen variable, thus eliminating it. This process is then 
repeated until all propositional variables are eliminated. As will be seen (see example 13. 3() . 
the success of the algorithm may in general depend upon the order in which this elimination 
of propositional variables is attempted. The algorithm accordingly makes provision for 
trying different orders of elimination. 

Now for a more formal description. The algorithm takes as input a modal formula 9? 
and proceeds as follows: 

"'^ Because the algorithm resembles a procedure of solving systems of linear algebraic equations by Gaussian 
elimination. 
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Step 1: Negate ip and rewrite -^ip in negation normal form by eliminating the connec- 
tives '— >' and and by driving all negation signs inwards until they appear only 
directly in front of propositional variables. 

Then distribute diamonds and conjunctions over disjunctions as much as possible, 
using the equivalences 0{(pVi/j) = {0(p V Oil>) and {ip\/ if)) f\0 = {(p A 9) V (ij: A 6) , in 
order to obtain \J a^, where no further distribution of diamonds and conjunctions 
over disjunctions is possible in any a^. 

The algorithm now proceeds on each of the disjuncts, separately, as follows: 

Step 2: Rewrite as i ^ Ofc, where i is a fixed, reserved nominal, not occurring in 
any a^, and only used to denote the initial current state. This is the only equation 
in the initial system. 

Step 3: Eliminate every propositional variable in which the system is positive or neg- 
ative, by replacing it with T or ±, respectively. 

Step 4: If there are propositional variables remaining in equations of the system, 
choose one to eliminate, say p, the elimination of which has not been attempted yet. 
Proceed to step 5. (So far this choice is made non-deterministically; some heuristics 
will be suggested in a sequel paper.) If all remaining variables have been attempted 
and Step 5 has failed, backtrack and attempt another order of elimination. 

If all orders of elimination and all remaining variables have been attempted and 
step 5 has failed, report failure. 

If all propositional variables have been eliminated from the system, proceed to 
Step 6. 

Step 5: The goal now is, by applying the transformation rules listed below, to rewrite 
the system of equations so that the Ackermann-rule becomes applicable with respect 
to the chosen variable p in order to eliminate it. Thus, the current goal is to 
transform the system into one in which every equation is either negative in p, or of 
the form a ^ p, with p not occurring in a, i.e. to 'extract' p and 'solve' for it. 

If this fails, backtrack, change the polarity of p by substituting -ip for it every- 
where, and attempt again to prepare for the Ackermann-rule. 

If this fails again, or after the completion of this step, return to Step 4. 

Step 6: If this step is reached by all the branches of the execution it means that all 
propositional variables have been successfully eliminated from all systems result- 
ing from the input formula. What remains now is to return the desired first-order 
equivalent. In each system, take the conjunction of all equations to obtain a for- 
mula pure, and form the formula Vy3a;oST(-ipure, xq), where y is the tuple of all 
occurring variables corresponding to nominals, but with yi (corresponding to the 
designated current state nominal i) left free if the local correspondent is to be 
computed. Then take the conjunction of these translations over the systems on all 
disjunctive branches. For motivation of the correctness of this translation the reader 
is referred to the examples in the following section as well as the correctness proof 
in sectional 

Return the result, which is the (local) first-order condition corresponding to the 
input formula. 

Note that, if a right order of the elimination of the variables is guessed (i.e. chosen 
non-deterministically) rather than all possible orders explored sequentially, this algorithm 
runs in nondeterministic polynomial time. However, we believe that additional rules which 
determine the right order of elimination (if any) can reduce the complexity to PTIME. 
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2.2. Transformation Rules. The transformation rules used by SQEMA are listed below. 
Note that these are rewriting rules, i.e. they replace the equations to which they apply.^ 

I. Rules for the logical connectives: 

A-rule: ij. 



(3^7,13^6 

Left-shift V-rule: 4 Right-shift V-rule: 



(/3 A -7) ^ , 

/3 ^ 7 V(5 



O-I7 ^ S 
Right-shift G-rule: JJ- 

7 ^ □(J 

j ^07 

j ^ Ok, k ^ 7 
where j is any nominal and k is a new nominal. 

Sometimes we will write i?jk as an abbreviation for j Ok. 

II. Ackermann-Rule: This rule is based on the equivalence given in Ackcrmann's 
Lemma. It works, not on a single equation, but by transforming a whole set of equations 
as follows: 

ai -^p, 
Piip), 

Pm{p), 

where: 

(1) p does not occur in cti, . . . , 

(2) each of /3i , . . . , /J^ is negative in p; 

(3) no other equations in the system contain p. 

Hereafter, we will refer to the formulae ai, . . . , a„ in an application of the Ackermann- 
rule as a-formulae, and to Pi, ... , as P- formulae. 

III. Polarity switching rule: Switch the polarity of every occurrence of a chosen 
variable p within the current system, i.e. replace ^p by p and p by -ip for every occurrence 
of p not prefixed by -1. 

IV. Auxiliary Rules: These rules arc intended to provide the algorithm with some 
prepositional reasoning capabilities and to effect the duality between the modal operators. 

^Rajeev Gore has remarked that these transformation rules are reminiscent of the rules in display logics. 



7 ^ 0(5 
Left-shift D-rule: ^ 



/3i[(ai V... Va„)/p], 
Pm[{ai\/ ...y an)/p]. 
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(1) Commutativity and associativity of A and V (tacitly used). 

(2) Replace 7 V -17 with T, and 7 A -17 with _L. 

(3) Replace 7 V T with T, and 7 V _L with 7. 

(4) Replace 7 A T with 7, and 7 A _L with _L. 

(5) Replace 7 — > _L with -17 and 7 ^ T with T. 

(6) Replace _L ^ 7 with T and T ^ 7 with 7. 

(7) Replace -lO-i with □ and ^D-i with O. 

Note that, apart from the polarity switching rule, no transformation rule changes the 
polarity of any occurrence of a propositional variable. 

3. Some examples of the execution of SQEMA 

In this section we illustrate the execution of SQEMA on several formulae and discuss 
some of its features. 

Example 3.1. Consider the formula ODp nOp. 
Step 1: Negating, we obtain ODp A OU-ip. 
Step 2: The initial system of equations: 

||i^ (ODpAOD^p) . 

Step 3: The formula is neither positive nor negative in p, the only occurring propo- 
sitional variable. 
Step 4: We choose p to eliminate — our only option. 

Step 5: We will try to transform the system, using the rules, so that the Ackermann- 
rule becomes applicable. 
Applying the A-rule gives 

i OHp 
i Oa^p ' 

Applying first the O-rule and then the D-rule to the first equation yields: 

mi 

j ^ °p 

i OU^p 



and then 



The Ackermann-rule is now app' 



i -> OD^p 
icable, yielding the system 



i ^ on-(o-ij) 

All propositional variables have been successfully eliminated, so we proceed to 
step 6. 

Step 6: Taking the conjunction of the equations gives 

i2ijA(i^0n-(0-ij)). 

Negating we obtain 

i?ij^(iAnoo-ij), 
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which, translated, becomes 



yyj3xo[Ryiyj {xq = yi) A \/y{Rxoy 3u{Ryu A 3v{Rvu A v 
and simphfies to 



vM 



^vARyiVj ^y{Ryiy 3u{Ryu a Ryju))] 



defining the Church- Rosser property, as expected. Note that the variable yi occurs 
free and corresponds to the nominal i, which we interpret as the current state. Hence 
we obtain a local property. Also, we directly translate the abbreviation as Ryiyj, 
since ST(i ^j,a;o) is = — > 3z{Rxoz A z = yj), which clearly simplifies. 

Example 3.2. Consider the formula p A □(Op — > Oq) — > <ynnq. Note that it is not 
equivalent to a Sahlqvist formula (see '17 ). 

Step 1: Negating we obtain p A □(□-■p V Oq) A aOO^q. 

Step 2: i^lpA □(□^p V Oq) A aOO^q]. 

Step 3: Nothing to do, as the system is neither positive nor negative in p or q. 

Step 4: Choose p to eliminate. 

Step 5: Applying the A-rule twice, we get 



The system is now ready for the application of the Acker mann-rule, as p has been 
successfully isolated, and i — > □(□-■p V Oq)) is negative in p: 



Step 4: Choose q, the only remaining option, to eliminate next. 

Step 5: Successively applying the D-rule, Left-shift V-rule and D-rule again, we get 



Step 6: (i □00[n"i(n"i-.i V □-■i)), when negated becomes i A Onn[0"i(0"ii A 
Oi). Translating into first-order logic we obtain 

3xo[xo = yi A 3zi{RxoZi A yz2{RziZ2 yz^{Rz2Z^ 3ui[Ruiz^ 
A3u2{Ru2Ui Au2 = yi) A 3u3{RuiU3 A Us = yi)])))]- 

Note that, in this example, the order of elimination of the prepositional variables is inessen- 
tial, as eliminating q first and then p works equally well. 

Example 3.3. Consider the formula □(□p ^ q) ^ P- The current implementations 
of both SCAN (http : //www.mpi-inf .mpg.de/~ohlbach/ scan/ corr jEorm.html) and DLS 
(http://www.ida.liu.se/labs/kplab/projects/dls) fail on it. Let's see what SQEMA 
does with it: 



1 — > p 

i □(□^p V Oq)) 
i aOO^q 



i □(□-.! V Oq)) 
i aOO^q 



O-i(O-iiAOi) ^q 
i aOO^q 
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Step 1: a{{0^p V A {^q V Op)) A ^p. 
Step 2: 

II i a{{0^p y q) A {^q V Dp)) A -^p . 

Step 3: All variables occur both positively and negatively, so we go on to step 4. 
Step 4: Choose q to eliminate. 

Step 5: Applying the A-rule and the D-rule we transform the system into 

O-^i ^ {O^pVq) 

O-H^ l^qyap) . 

i — > -ip 

Applying the Left Shift V-rule to the first equation yields 

(O^^i ADp) ^ q 
O^'^i ^ {^q y ap) . 
i — > -ip 

to which the Ackermann-rule is applicable with respect to q. This gives 

O-^i (^O-^i V ^Op V Op) 
i -ip 

The first equation in the system is now a tautology and may be removed (we 
interpret systems of equations conjunctively), yielding the system 

II i ^ , 

in which p may be replaced by _L since it occurs only negatively, resulting in the 
system 

II ^ 

Step 6: Negating we obtain _L. 

Some remarks are in order here. Firstly, note that the success of the algorithm may 
depend essentially upon the ability to do some propositional reasoning. Particularly, it had 
to recognize C>~^i — > V -iDp V Op) as a tautology. If this had been written in a 

slightly different form, however, say as <C>~^i V O-^p V Dp), its tautological status 

may not be recognized so easily. This is where a stronger version of Ackermann-rule (see 
Section El, which involves testing for monotonicity, rather than polarity would prove to be 
useful. For, testing the consequent for (upward) monotonicity would give the answer 'yes' 
— this is, post hoc, easy to see, since we already know that the consequent is a tautology. 
This would allow us, after changing the polarity of p, to apply the stronger, monotonicity 
based, version of the Ackermann-rule to the system 

O-M (^O-M V Op V a^p) 
i ^ p ' 

and obtain 

II O^^i^ (D-i^iVOivn^i) , 

which simplifies to 

II O^M^ (□"^^iVOiV^Oi) , 

which, as before, is a tautology, yielding, in step 6, the first order equivalent ±. 

Secondly, suppose that we had tried to eliminate p first. Note that we will gain nothing 
by changing the polarity of p for we cannot get the occurrence of p 'out' under the diamond 
in 0~^i — > {Op V q). Indeed, the system may be transformed to become 
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O-^i {O^pV q) 
0-\0-^i/\q) , 
i — > -ip 

to which we may apply the Ackermann-rule with respect to p and obtain 

O-H (0^(0-l(0-li A q)) V q) 
^(O-i(O-iiAg)) 
It is now not difficult to see that SQEMA gets stuck on this system. 
Moral: the order of elimination does matter sometimes, that is why the algorithm 
incorporates the backtrack option. Of course, theoretically this may lead to an exponential 
increase of the number of steps, but in practice this apparently does not happen if suitable 
heuristics or additional rules can be applied to determine the right order of elimination. 

4. Correctness and canonicity of SQEMA 

4.1. Modal formulae as operators on general frames. For what follows, it is use- 
ful to think of modal formulae as set-theoretic operators. If G let PROP(^) = 
{qi,---,qm} be the propositional variables occurring in ip, HOM(ip) = } the 
nominals occurring in ip, and /^{ip) = PROP(V') U NOM(i/;). Recall that with every 
model Ai = {W, R, V) and a modal formula ip in ML we associate the set [V'Ia^ denot- 
ing the extension of the formula ip in the model A4. Clearly, [V'Ia^ only depends on 
the valuation of PROP(^/') in M, and therefore {ipj defines an operator from V{W)'^ to 
V{W). Likewise for formulae of the extended language ML~^, with the amendment that if 
PROP(^) = {qi,...,qni} and NOM(V') = {ii, . . . , i„}, then [V^l : ViW)"^ ^ V{W). 
Whenever appropriate, we will simply identify formulae with the operators they define. For 
instance, \i if = D-^pVOj, x and A<ZW,vje will write □"MvO{x} or □"MuO{x} 
to denote the extension of ip when the valuation of p is A and that of j is {x}. If we have 
to specify the relation, we will write {R) and [R\ for OA and respectively. Also R{A) 
(respectively i?~^(j4)) will be used to denote the set of successors (respectively predeces- 
sors) of points in A. We will be sloppy and write R{x) (respectively R~^{x)) for R{{x}) 
(respectively R~'^{{x])). 

We will now extend {ad hoc) the notion of local validity at a state w of a general frame 
5^ = (ly, i?,W) for the basic language ML, to formulae of the extended language ML"*" as 
follows: For a formula ip € ML"*", it is the case that '^^w h tp \i \\- ip for every model 

A4 gM(3^) extended in a standard way to the inverse modalities, and where all nominals 
can take as values any v G W , except for the special nominal i which is only interpreted in 
the current state w. Note that, in doing so, we lose the property that the extension of any 
formula in a model based on a general frame will itself be an admissible set (i.e. a member 
of W). 

4.2. Topology on Descriptive Frames. Hereafter, unless otherwise specified, we only 
deal with general (and in particular, descriptive) frames in ML. For the basic topological 
concepts used in this section, the reader may consult any standard reference on topology, 
e.g. EOl. 

With every general frame ^ = {W,R,W), we associate a topological space {W,T{^)), 
where W is taken as a base of clopen sets for the topology T(5). Let C(W) denote the set 
of sets closed with respect to T{^). 
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A general frame ^ = {W, R, W) is said to be differentiated if for every x,y G W, x ^ y, 
there exists X G W such that x G X and y ^ X (equivalently, if T{^) is Hausdorff); 
tight if for all x,y e W it is the case that Rxy iff x G f]{{R){Y) : Y e W and y e ¥} 
(equivalently, if R is point-closed, i.e. R({x}) is closed for every x G W); compact if every 
family of admissible sets from W with the finite intersection property (FIP) has non empty 
intersection (equivalently, if r(5') is compact). ^ is called descriptive it is differentiated, 
tight and compact. 

Definition 4.1. A formula tp G ML is locally d-persistent if for every descriptive frame 
^= (VF',i?,W) and w£W, 

IH^ iff Ih V, 

where F = {W, R) . 

Note that we only talk about local d-persistence of formulae from the basic language 
ML. It is well known (see e.g. j2|) that (local) d-persistence implies canonicity of formulae 
in ML because the canonical general frame for every normal modal logic is descriptive, and 
hence every d-persistent axiom is valid in its canonical frame. 

Definition 4.2. A formula 7 = "y{pi, ■ ■ ■ ,Pn) from ML~^ is a closed operator in ML, if 
for every descriptive frame ^ = {W, R, W) , if Pi , . . . , G C (W) and any singletons are 
assigned to the nominals in 7, then 7(^1, . . . , Pn) G C(W), i.e. when applied to closed sets 
in a descriptive frame it produces a closed set; 7 is a closed formula in ML if whenever 
applied to admissible sets in any descriptive frame it produces a closed set. Thus, if a 
formula is a closed operator, then it is a closed formula, but not necessarily vice versa. 

Similarly, a formula from ML~^ is an open operator in ML if whenever applied to open 
sets in a descriptive frame it produces an open set; it is an open formula in ML if whenever 
applied to admissible sets it produces an open set. 

Note that the operators O and O^-*^ distribute over arbitrary unions, and □ and 
distribute over arbitrary intersections. Since every closed set can be obtained as the in- 
tersection admissible sets and each open set as the union of admissible sets, and O and □ 
applied to admissible sets yield admissible sets, it follows that Op is an open operator and 
□p is a closed operator, and this holds even for arbitrary general frames. 

The proof of the next lemma, originally from 11 , can also be found in |22] or [2]. 

Lemma 4.3 (Esakia's Lemma for O). Let ^ be a descriptive frame. Then for any downward 
directed family of nonempty closed sets {Ci : i € 1} from T(^), it is the case that O Hie/ C'j = 

nieIoc^■ 

Corollary 4.4. Op is a closed operator in ML. 

Lemma 4.5 (Esakia's Lemma for O""*^ in ML). Let ^ be a descriptive frame. Then 
O^^ flig/ C'j = Cliei for any downwards directed family of nonempty closed sets 

{Ci-.iel} fromTi:S)- 

Proof. The inclusion O^^ flie/ Ci C Hie/ trivial, so suppose thatxo O-ipiG/C'i, 

i.e. {R){xo)rif]-^j Ci = 0. Now {R){xo) is closed by cor oUarv 14 . 41 and the fact that singletons 
are closed in descriptive frames. Hence {{R){xo)}L} {Ci : z G 1} is a family of closed subsets 
with empty intersection which, by compactness, cannot have the FIP. Thus there is a finite 
subfamily {Ci, . . . , C„} C {Q : i G /} such that {R){xo) n Ci n • • • n C„ = 0. Since 
{Ci : i G /} is downwards directed, it follows that there that there exists a C G {Ci : i £ 1} 
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such that C C n{Ci, • • • jC'n} and {R){xq) n C = 0. But then xq ^ O'^C, and hence 
Lemma 4.6. O^'^p is a closed operator in ML. 

Proof. The proof is adapted from ^7j. Let ^ = {W,R,W) be a descriptive frame in ML. 
We have to show that for any closed ^ C it is the case that O^-'^(^) = f^{B € W : 
'0~^A C B}. Note that 0~^(A) = R{A). The inclusion from left to right is trivial. In 
order to prove the right-to-left inclusion, suppose that xq 0~^{A), i.e. for all y G j4 
it is not the case that RyxQ. By the point-closedness of R we have that R{y) = C\{^ ^ 
W : y G OB}. Then for each y ^ A there must exist a B^ € W such that y E OB^ and 
Xq B^ , and hence A C {J^nB^ : y € A}. Therefore {OB^ : y G A} is an open cover 
of the closed set A, so by compactness there exists a finite subcover OBi, . . . , OB^- Then 
A C nBi U • • • U OBn and xq Bi, I < i < n. Since distributes over arbitrary unions, 
we then have O'^A C O'^aBi U • • • U O-'^nB^. And, since for any X CW, O'^aX C X, 
we have O-^A C Bi U ■ ■ ■ U B n- So we have found an admissible set containing O ^A, 
not containing xq, and hence xq fli-^ ^ ^ • ^ B}, proving the inclusion and the 

lemma. □ 

Corollary 4.7. is an open operator in ML. 

Proof. By the duality of and □ 

Definition 4.8. A formula G ML~^ is syntactically closed if all occurrences of nominals 
and in if are positive, and all occurrences of in (p are negative; if is syntactically 
open if all occurrences of and nominals in ip are negative, and all occurrences of in 
99 are positive. Clearly maps syntactically open formulae to syntactically closed formulae, 
and vice versa. 

Lemma 4.9. Every syntactically closed formula from ML^ is a closed formula, and every 
syntactically open formula is an open formula. 

Proof. By structural induction on syntactically open / closed formulae, written in negation 
normal form, using the facts that O and □ are open and closed operators, is a closed 
operator, D"^ and open operator and the fact that singletons are closed in descriptive 
frames. □ 

Lemma 4.10. Let '^{qi, ■ ■ ■ , QmP, ii; ■ • • im) be a syntactically closed formula which is posi- 
tive inp and with PROP{ip) = {qi, . . .,qn,p} and NOM((/?) = {ii, . . .im}- Let^ = {W,R,W) 
be a descriptive frame in ML. Then for all Qi, ■ ■ ■ ,Qn £ W, xi,...,Xm G W, and 
C G C(W), it is the case that (p{Qi, . . . , Qn, C, {xi}, . . . , {2;^}) is closed in T{^). 

Proof. We assume that (p is written in negation normal form, hence we may also assume 
that does not occur, as all occurrences have to be negative, and rewriting in negation 
normal form will change these into ■0~^. We proceed by induction on ip. If 99 is T, _L or 
one oi qi, . . . , qn,p,ii, . . . im it is clear that (p{Qi, . . . , Qn, {xi}, ■ ■ ■ , {xm}, C) is a closed set. 
This is also the case if ip is the negation of a propositional variable from gi, . . . , The 
case when ip is -^p does not occur. 

The cases for A and V follow since the finite unions and intersections of closed sets are 
closed. The cases for O and follow from corollarv 14.41 and 14. 6( respectively. The case 
for □ follows is immediate from the fact that □ is a closed operator, as was noted earlier. □ 
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Lemma 4.11 (Esakia's Lemma for Syntactically Closed Formulae). Let 
ip{qi, . . . , qn,p,ii, ■ ■ ■ im) be a syntactically closed formula with PROP(99) = {qi, . . . ,qn,p} 
and NOM{ip) = {ii, . . . i^} which is positive in p. Let ^ = {W, R,W) be a descriptive frame 
in ML. Then for all Qi, . . . , Qn £ W, xi, . . . , Xm G W and downwards directed family of 
closed sets {Ci : i € L} it is the case that 

(p{Ql,. ..,Qn,f] Ci, {Xi},..., {Xm}) = P) ^p{Ql, ■■■,Qn, Q, {xi}, {Xm})- 

Proof. The proof is by induction on (p. For brevity we will omit the parameters Qi, . . . , Qn, 
xi, . . . ,Xm when writing (sub)formulae. We assume that formulae are written in negation 
normal form, i.e we may also assume that D"^ does not occur, as all occurrences have to 
be negative, and rewriting in negation normal form will change these into 0~^. The cases 
when 99 is _L, T or among qi, . . . , qn,p,ii, ■ ■ ■ im are trivial, as are the cases when is the 
negation of a propositional variable among q\, . . . , q^ • The inductive step in the case when 
(p is of the form 71 A 72 is also trivial. 

Suppose if is of the form 71 V 72. We have to show that 7i(nig/ C'i) U 72(046/ Ci) = 
C\i(zi{li{Ci) U 72(Cj)). The interesting inclusion is from right to left, so assume that xq 
71 (Hie/ C"*) U 72(0^6/ '^*)' i-*^ ^0 ^ niG/7i(Ci) U nig7 72(Ci), by the induction hypothesis. 
Thus there exists Ci,C2 G {Q : i G /} such that xq ^ 7i(C'i) and xq 72(C'2)- By the 
downward directedness of {Ci : i G L} there is a C G {Ci : i G /} such that C C Ci n C2. 
Thus, since 71 and 72 are positive and hence upwards monotone in p, it follows that xq 
71(C) and xo 72(C), and hence that xq ^ Oi^iiniCi) U 72(Ci)). 

Suppose if is of the form O7. We have to show that Oj{f]i^j Ci) = Hie/ ^7(^*1)- By the 
inductive hypothesis we have '^jiCliei ^i) ~ Cliei ^(^i)- 7(^«) ~ ^ some Ci, then 
Cliei l{Ci) = = HiG/ '^liCi), so we may assume that 7(Cj) / for all z G /. Then, by 
Lemma [4.101 {7(Ci) : i G /} is a family of non-empty closed sets. Moreover, {7(Cj) : i G /} 
is downwards directed. For, consider any finite number of members of {7(Ci) : i G /}, 
7(Ci), . . . ,7(Cn), say. Then there is a C G {Ci : i G /} such that C C HiLi Ci. But then 
7(C) G {7(Ci) : i G /} and 7(C) C f]^^^ l{Ci) by the upwards monotonicity of 7 in p. Now 
we may apply Lemma and conclude that ^flig/ 7(^*1) = C\i^i^lCi). 

The case when ip is of the form 0^^7 is verbatim the same the previous case, except 
that we appeal to Lemma l4. 51 rather than Lemma l4. HI in the last step. 

Lastly, consider the case when 93 is of the form ^7. This follows by the inductive 
hypothesis and the fact that □ distributes over arbitrary intersections of subsets of VF. □ 

The next lemma is needed for the following reason: for the proof of canonicity of all 
formulae on which SQEMA succeeds we will need a version of Ackermann's lemma that is 
true of formulae in the extended language ML"*", when interpreted over descriptive frames. 
As already noted, the extension of an ML"'"-formula need in general not be an admissible 
set in such general frames. This creates and obvious impediment for the proof of one 
direction of the equivalence in the modal Ackermann's lemma, as the interpretations of 
propositional variables must be admissible sets. We can push it through, however, at the 
price of additional restrictions on formulae in terms of syntactic openness and closedness. 

Lemma 4.12 (Restricted Version of Ackermann's Lemma for Descriptive Frames). Let: 
* d = {W, R, W) be a descriptive or Kripke frame in ML; 
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• A{qi, qn,ii, ■■■,im) be a syntactically closed formula with PROP(^) C {(71, 
and NOM(A) C {ii, 

• B{qi, ...,qn,p,ii, ■■■,im) be a syntactically open formula with PROP(i?) C {g^, ...,qn,p} 
and NOM{B) C which is downwards monotone in p. 

Then for all Qi, . . . , Qn € W and xi, . . . , Xm € W: 

B{Qi, ...,Qn, A{Qi, ...,Qn, {Xl}, ... , {Xm}), {xi}, . . . , {Xm}) = W 

if and only if there is a P £ W such that 

A{Qi,...,Qn,{xi},...,{xm}) ^ P and B{Qi,...,Qn,P,{xi},...,{xm}) = W. 

Proof. For the sake of brevity we will omit the parameters Qi, . . . , Qn, {xi}, . . . , {xm} in 
what follows, and simply write A, B{P) etc. The implication from right to left follows 
by the downwards monotonicity of B in p. For the converse, if 3^ is a Kripke frame, i.e 
W = 2^, we can simply take P = A{Qi, . . . , Q„, {xi}, . . . , {xm}), since all subsets of W 
are admissible. 

Now, assume that J is any descriptive frame and B{A) = W . Let B'{p) be the negation 
of B[p) written in negation normal form. Then B'[p) is a syntactically closed formula, and 
B'{A) = 0. We need to find an admissible set P G W such that A ^ P and B'{P) = 0. 
Since yl is a syntactically closed formula, it follows by Lemma 14.91 that A is a closed subset 
of W and hence that A = {^{C G W : ^ C C}. Hence = B'{A) = B'{{^{C € W : 
^ C C}) = n{-S'(C) : C € W and ^ C C}, by Lemma KHX Again by Lemma |01 
{B'{C) : C € W, ^ C C} is a family of closed sets with empty intersection. Hence, by 
compactness, there must be a finite subfamily, Ci, . . . , say, such that fllLi -^'(C'j) = 0- 
But then C = fllLi ^« admissible set containing A, and B'{C) = 0, i.e B{C) = W . 

Hence we can choose P = C. □ 

We will refer to the equations which are pure formulae (i.e. do not contain propositional 
variables) as pure equations, and to the rest, as non-pure equations. 

Lemma 4.13. During the entire (successful or unsuccessful) execution of SQEMA on any 
input formula from ML, all formulae on the left-hand side of all non-pure SQEMA- equations 
are syntactically closed, and all formulae on the right-hand side of the non-pure equations 
are syntactically open. 

Proof. We follow any one branch of the execution, proceeding by induction. The starting 
system is of the form ||i — > ip , where ^|J € ML. For this system the conditions of the lemma 
hold, since i is syntactically closed and all ML-formulae are both syntactically closed and 
open. Now suppose that in the process of the execution we have reached a system satisfying 
the conditions of the lemma, i.e. all formulae on the left-hand side of all non-pure SQEMA- 
equations are syntactically closed, and all formulae on the right-hand side of the non-pure 
equations are syntactically open. It is straightforward to check that the application of any 
transformation rule to this system will preserve these conditions. In the particular case 
when the Ackermann-rule is applied, we note the following: (i) the pure equations in the 
system contain no propositional variables, and are hence disregarded in any application of 
the Ackermann-rule; (ii) by the inductive hypothesis the disjunction of left-hand sides which 
is substituted for the variable being eliminated are syntactically closed; (iii) substituting 
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a syntactically closed formula for positive (resp., negative) occurrences of a variable in a 
syntactically closed (resp., open) formula yields a syntactically closed (resp., open) formula. 

□ 

The content of the next lemma is essentially that the global satisfiability of systems 
of SQEMA-equations is invariant under all SQEMA transformation rules, subject to the 
constraint that i is evaluated to the current state. 

Lemma 4.14. Let Eq, . . . , Er be the sequence of systems of equations produced on one 
branch by SQEMA when executed on a certain input formula ip{qi, . . . ,qn) £ ML with 
PROP(99) = {qi, . . . ,qn}, and let i, be the nominals introduced during the exe- 

cution. Further, let ipi{qi, . . . ,qn,i,ii, ■ ■ ■ ,im) be the formula obtained by taking the con- 
junction of the equations in Ei for < i < r . Then for any descriptive or Kripke frame 
5" = {W, R, W) and current state w G W , there are Qi, . . . , Qn G W and xi, . . . , Xm G W 
such that 

ifiiQl, ■■■,Qn, {w}, {xi}, . . . , {xm}) = W 
if and only of there are Qi, . . . , Qn G W and xi, . . . , Xm € W such that 

ipi+l{Ql,. ..,Qn, {w}, {Xi}, ... , {Xm}) = W, 

for < i < r. 

Proof. Note that = i — > -193. For each i, the system i^i+i is obtained from the system 
Ei by the application of some transformation rule. Let 5^ be a descriptive or Kripke frame, 
and w a state in it. We need to verify that, whichever transformation rule was applied, 
(pi is globally satisfiable on ^ if and only if (/Jj+i is, subject to the constraint that i be 
always interpreted as w. This is immediate to see for all the transformation rules, except 
the Ackermann-rule, since they are based on simple propositional and (modal and hybrid) 
semantic equivalences. The only interesting case then, is the one for the Ackermann-rule. 
Suppose -Ej+i is obtained from Ei by application of this rule. Then ipi = Aj=i(o^j 
p) A AjLi(/5j) ^ /\"j=i 7i' where no Uj contains p, each f3j is negative in p, and no 7^ contains 
any occurrence of p. Note that all pure equations in the system Ei will be among the 7^. 
Now all the a^-'s are left-hand sides of non-pure SQEMA equations and hence, by lemma 

oa V 

aj is syntactically closed. Further, if we eliminate the implication sign from the 
/3j's, each becomes a disjunction of the negation of a left-hand side of a non-pure SQEMA- 
equation with the right-hand side of such an equation, which, again by lemma 14.131 is 
syntactically open. 

Then ^pi+i = /\Y=i{Pj) ^ A^=i7i> where each /3^- is obtained from (3j by substituting 
Vj=i Q^i for all occurrences of p. The proof is complete once we appeal to lemma 11. 121 □ 

Theorem 4.15. If SQEMA succeeds on a formula ip € ML, then is locally d-persistent 
and hence canonical, and moreover the first- order formula returned by SQEMA is a local 
equivalent of ip. 

Proof. Suppose that SQEMA succeeds on 93 G ML, and that pure{ip) is the pure formula 
obtained as the conjunction of the final system of SQEMA-equations in the execution. 
Further, for simplicity and without loss of generality, assume that the execution does not 
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branch because of disjunctions. We may make this assumption since a conjunction of d- 
persistent formulae is d-persistent, and the conjunction of local first-order correspondents 
of modal formulae is a local first-order correspondent for the conjunction of those formulae. 
For the canonicity, let 5 = {W, R,W) be a descriptive frame and w G W. Then: 

ly- if iff 

for some model A4 based on 5 with i denoting w, it is the case that Ih i — > -k/?. 

Note that ||i — s- -k/j is exactly the first system of SQEMA-equations obtained when 
the algorithm is run on if. Hence pure{ip) is obtained from i -k^ by the application of 
transformation rules. 

Hence, A4 \\- i ^ ^ip, for some model A4, based on ^ with i denoting w, iff 

(by lemma 14.141 for descriptive frames) for some model M, based on 5 with i denoting 
w, it is the case that A4 Ih pure((/9), iff 

for some model A4 based on the underlying Kripke frame F with i denoting lu, it is the 
case that A4 Ih pure((/9), since we allow nominals to range over all singletons both in Kripke 
and general frames. 

Now by the Kripke frames version of Lemma [4.14[ and the fact that pure{(p) is obtained 
from i —5- by the application of transformation rules, this is the case iff for some model 
M based on the underlying Kripke frame F with i denoting w, we have A4 \\- i ^ ^ip. 

This, in turn, will be so iff F, w 1/ if. This proves the local d-persistence of ip. 

Now, for the local first-order equivalence let ^ = {W, R) be a Kripke frame and w £ W. 
As in the formulation of lemma 14.141 let Eq, Er be the sequence of systems of equations 
produced by SQEMA when executed on if and let ipj be the formula obtained by taking the 
conjunction of the equations in Ej. We define the translation of a system Ej, TR{Ej), to be 
the second order formula 3P3lf^XQST(ipj , xq) , where P is the tuple of all predicate variables 
and y the tuple of all variables corresponding to nominals other than i, occurring in (pj. 
Note that yi, corresponding to i, is the only free variable in TR(Sj), and that TIl{Er) is 
3yVxoST(pure((y9),a;o). Then: 
h ip iff 

^?hVP5T(<^,xo)[xo ■.= w] iff 

d \= VP3xo5T(i A ip,xo)[yi := w] iff 

5" ^ 3PVxo5'r(i ^(p,xo)[yi := w], i.e. iff 

d^TR{Ei)[y, ■.= w]. 

Now, phrased in second-order logic, lemma l4. 141 savs that 
^ ^ TR{Ej)[yi := w] if and only if 5^ |= TR{Ej+i)[yi := w], for all 1 < j < r. 

Hence we get that ^^w h ip iS ^ Y= 3yVxoST(pure(93), rEo)[yi := w], 
i.e that h ip iS ^ \= Vy3xo-'ST(pure((/?), xo)[yj := w]. 

Hence Vy3xo~'ST(pure(93), xo) is a local first-order correspondent for ip, and exactly 
what SQEMA returns. Accordingly, VyiVy3a;o-'ST(pure((^), xq) is a global first-order corre- 
spondent of □ 

5. Completeness results 

In this section we show that SQEMA succeeds in computing the local first-order frame 
equivalent of every Sahlqvist formulae, and then we extend that to every monadic inductive 
formula. While the latter result subsumes the former, we nevertheless present both proofs 
in order to take the reader first through the simpler case. 
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5.1. Completeness of SQEMA on Sahlqvist Formulae. Let us briefly recall what 
Sahlqvist formulae are (see |21 113 ) : 

• a boxed atom is a prepositional variable, prefixed with finitely many (possibly none) 
boxes. 

• a Sahlqvist antecedent is a formula built up from T, ±, boxed atoms, and negative 
formulae, using A, V and diamonds. 

• a Sahlqvist implication is a formula of the form ip Pes where (p is a Sahlqvist 
antecedent and Pos is a positive formula. In particular, note that any negative 
formula is a Sahlqvist antecedent. 

• a Sahlqvist formula is built up from Sahlqvist implications by applying conjunctions, 
disjunctions, and boxes. ^ 

The following lemma is useful: 

Lemma 5.1. Let ip be a Sahlqvist formula, and ip' the formula obtained from ^p by im- 
porting the negation over all connectives. Then pJ is a Sahlqvist antecedent. 

Proof. Induction on the construction of ip from Sahlqvist implications. If (/? is a Sahlqvist 
implication a — > Pos, negating and rewriting it as a A-^Pos already turns it into a Sahlqvist 
antecedent. 

ip = Oip^ where ip satisfies the claim, then -199 = O^ip hence the claim follows for (p, 
because Sahlqvist antecedents are closed under diamonds. 

Likewise, if 99 = ipi A 1^2, where Vi and 1^2 satisfy the claim, then -k/? = -i-i/'i V ^ip2 
hence the claim follows for ip, because Sahlqvist antecedents are closed under disjunctions. 

The case of p = ipi y ijj2 is completely analogous. □ 

Corollary 5.2. Every Sahlqvist formula is semantically equivalent to a negated Sahlqvist 
antecedent, and hence to a Sahlqvist implication. 

Sahlqvist's theorem states that all Sahlqvist formulae are elementary and canonical. 
For a thorough discussion on Sahlqvist formulae and proof of Sahlqvist's theorem, see e.g. 

Lemma 5.3. Let E be a system of SQEMA equations of the form j ^ /5, with j a nominal 
and P a Sahlqvist antecedent built up without using V, except possibly inside negative formu- 
lae. Let p be any propositional variable occurring both negatively and positively in E. Then 
E can be transformed, using only the A-rule, O-rule, O-rule and Ackermann-rule, into a 
system E' , again with all equations of the form ] — > P, which does not contain p. 

Proof. All positive occurrences of p are in boxed atoms, occurring at most in the scope of 
conjunctions and diamonds. We first have to separate p, i.e. transform E into a system 
in which the all equations in which p occurs positively are of the form 7 ^ p, with p not 
occurring in 7. Start by applying the O-rule and A-rule to separate the boxed atoms of 
p. The system then obtained is still one in which all equations are of the form j — > /3, 
with j a nominal and (3 a Sahlqvist antecedent — recall that pure formulae are regarded 
as both positive and negative. All equations in which p occurs positively are now of the 
form j — > □"p. By applying the D-rule these are transformed into equations of the form 
(0~^)"j p. Note that the system is now no longer in the desired form of implications 

"^The definition in 2 requires that disjunctions are only appfied to formulae not sharing variables. Ap- 
parently, this requirement is unnecessary. 
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from nominals to Sahlqvist antecedents. This is remedied by applying the Ackermann- 
rule to ehminate p, together with ah equations of the form (0^^)"'j —>■ p. Note that pure 
formulae are substituted for negative occurrences of p, hence the substitution leaves negative 
formulae negative, and hence Sahlqvist antecedents as Sahlqvist antecedents. The system 
now obtained is once again of the desired form, and p has been eliminated. □ 

Theorem 5.4. SQEMA succeeds on every Sahlqvist formula. 

Proof. Let iphea Sahlqvist formula. Let us see what SQEMA does with it. In step one, -199 is 
formed and the negation imported over all connectives. Call the formula so obtained if' . By 
Lemma l5. II y?^ is a Sahlqvist antecedent, ip' is now transformed into the form Vj=i '^j with 
each Oj a Sahlqvist antecedent built up from T, _L, boxed atoms and negative formulae 
without using disjunctions by distributing conjunctions and diamonds over disjunctions. 
Note that it is possible to "extract" all disjunctions, except possibly some within negative 
formulae, since in ip' none of these disjunctions occur within the scope of boxes. 

From here the algorithm proceeds on each disjunct separately; we will follow one of 
them, Oj. Let pi, . . . ,Pm be an arbitrary ordering of the variables occurring in aj. The 
initial system of equations is 

II i^aj- 

Note that this is a system of the form required by Lemma 15.31 If pi occurs only 
positively (negatively) SQEMA eliminates it by substituting it with T (_L). If pi occurs both 
negatively and positively, it follows from Lemma [5.3l that SQEMA will eliminate it. In both 
cases the resulting system is again one of the form required by Lemma 15.31 Propositional 
variables P2, ■ ■ ■ ,Pm remain to be eliminated. Proceeding inductively, noting that after each 
elimination Lemma 15.31 remains applicable, it follows that SQEMA will succeed in doing 
this. Negating and translating the resulting pure formula, SQEMA obtains the first-order 
(local) frame equivalent for aj. Taking the conjunction of these equivalents for 1 < j < n 
yields the first-order (local) frame equivalent for the Sahlqvist formula (f given as input. □ 

5.2. Completeness of SQEMA on Monadic Inductive Formulae. 

Definition 5.5. Let [| be a symbol not belonging to ML^. Then a box-form of ^ in ML^ is 
defined recursively as follows: 

(1) jj is a box-form of jj; 

(2) If B(jJ) is a box-form of [1 and □ is a box-modality then □B(tj) is a box-form of jj. 

(3) If B(tt) is a box- form of jj and A is a positive formula then A ^ B(tj) is a box- form 
of «. 

Thus, box-forms of ^ are, up to semantic equivalence, of the type 

ylo ^ 01(^1 ^...□„(A„^ #)...) 

where Di, . . . , □„ are compositions of boxes in ML~^ and Ai, . . . , An are positive for- 
mulae (possibly, just T). 

Definition 5.6. By substituting a propositional variable p for fj in a box-form B(tt) we 
obtain a box-formula of p, namely B(p). The last occurrence of the variable p is the head 
of B(p) and every other occurrence of a variable in B(p) is inessential there. 

Definition 5.7 f jl7l Hj). A monadic regular formula is a modal formula built up from T, 
_L, positive formulae and negated box-formulae by applying conjunctions, disjunctions, and 
boxes. 
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Definition 5.8. The dependency digraph of a set of box-formulae B = {Bi(pi), . . . , B„(|?„)} 
is a digraph = {V, E) where V = {pi, . . . is the set of heads in and edge set 
such that PiEpj iff pi occurs as an inessential variable in a box from B with a head pj. A 
digraph is acyclic if it does not contain oriented cycles (including loops). 

The dependency digraph of a formula is the dependency digraph of the set of box- 
formulae that occur as subformulae of that formula. 

Definition 5.9 ( jlTl Hj). A monadic inductive formula (MIF) is a monadic regular formula 
with an acyclic dependency digraph. 

Example 5.10. An example of a monadic inductive formula, which is not a Sahlqvist 
formula (see ^lEl); is: 

D =pA a{Op Dq) Oanq = ^py ^□(Op ^ Uq) V ODDg, 

obtained as a disjunction of the negated box-formulae ^p and -•□(Op — > Dg), and the 
positive formula ODDg. The dependency digraph of D over the set of heads {p, q} has only 
one edge, from p to q. 

An example of a regular, but non-inductive formula is □((^□p V (7) V {-^q V Dp)) V -ip, 
because the heads p and q depend on each other. 

The abbreviation NegMIF will be used for the negation of a monadic inductive formula 
in negation normal form. Note that the class of NegMIFs in a language are precisely those 
formulae built from T, _L, negative formulae and box- formulae using conjunction, disjunc- 
tion and diamonds, which have acyclic dependency digraphs. The abbreviation NegMIF* 
will be used for NegMIFs built up without the use of disjunction, i.e. the class of all formulae 
of the language built from T, _L, negative formulae and box-formulae using conjunctions 
and diamonds, which have acyclic dependency digraphs. 

We will call a system of SQEMA equations a HegH\¥ -system {HegHW* -system) if it 
has the form 

ii —> ai 

; ; 

where each ij is a nominal, and the formula ai A . . . A a^, obtained by taking the 
conjunction of all consequents of equations in the system, is a NegMIF (NegMIF*) in ML~^. 

Lemma 5.11. Let E be a NegMIF* -system of SQEMA- equations, and p any propositional 
variable occurring in E. Then p can be eliminated from E, either by substitution with T or 
_L, or by application of the SQEMA transformation rules. Moreover, the system of equations 
obtained after the elimination of p will again be a NegMIF* -system. 

Proof. Suppose p occurs only positively (negatively) in the system, then we substitute T 
(_L) for all its occurrences, eliminating it, and yielding a NegMIF*-system. 

So suppose p occurs both positively and negatively. We will separate out all positive 
occurrences to prepare for the application of the Acker mann-rule. The only positive occur- 
rences of p are heads of (possibly trivial) box-formulae in the consequents of the equations 
of the system. Let ij — > a be such an equation, where a contains positive occurrences of 
p. Exhaustive application of the A-rule and the O-rule splits ij — s- a into equations of the 
forms j Ok, 1 Box and m Neg, with j, k, 1, m nominals. Box a box-formula and Neg 
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a negative formula. As all box-formulae have been left intact, the dependency digraph is 
unchanged. Hence, after the application of these rules we still have a NegMIF*-system. 

Now it remains to separate the positive occurrences of p out of the equations of the 
form 1 Box, where Box is of the form V □2(. . . □n(^n V p) . . .)) where each Ai 

is a negative formula and each Dj a finite, possibly empty, sequence of box modalities. 
Successive alternative applications of the Left shift V-rule and Left shift D-rule transforms 
the equation 

1 ^ V □1(^1 V . . . □„(A„ V p) . . .)) 

into 

^AnAO-\...0^\\A^Ao)...)^p. 

The antecedent in the above equation is a positive formula, not containing p, because 
the dependency graph is loopless. Hence all positive occurrences of p in the system now 
occur as the consequents in equations of the form Pos — > p, where Pes is a positive formula 
not containing p. Let p be the disjunction of all the antecedents of the equations of the form 
Pos — > p. Then, by applying the Ackermann-rule, all equations of this form are deleted and 
p is eliminated by substitution of the positive formula p for every negative occurrence of p. 
Thus, the resulting system does not contain p, all antecedents of equations are nominals, 
and all consequents of equations are built up from negative formulae and box-formulae, by 
using only conjunctions and diamonds. 

To show that the resulting system is again NegMIF*, it remains to show that the de- 
pendency digraph is acyclic. We will do so by showing that whenever a new arc {q, u) was 
introduced by the application of the Ackermann-rule, there was already a directed path from 
vertex q to vertex u in the digraph before the substitution. Indeed, for the only way {q, u) 
could have been introduced was by the substitution of p for an inessential occurrence of p in 
some box-formula with u as head. But then q must occur in p, hence, by the construction of 
p, it must have occurred inessentially in some box-formula headed by p. But then {q,p) and 
{p, u) were arcs in the dependency digraph before the application of the Ackermann-rule, 
giving the desired path. Thus, the application of the Ackermann-rule cannot introduce 
cycles in a previously acyclic dependency graph, which completes the argument. □ 

Theorem 5.12. SQEMA succeeds on all conjunctions of monadic inductive formulae. 

Proof. An easy inductive argument, noting that the initial SQEMA-systems for every con- 
junction of monadic inductive formulae are NegMIF*-systems, and using Lemma 15.111 □ 

Remark 5.13. It has been proved in ^S] (see also that every Sahlqvist formula is 
tautologically equivalent to a conjunction of inductive formulae, which now provides another 
proof of completeness of SQEMA for Sahlqvist formulae. 

Corollary 5.14 (Sahlqvist theorem for inductive formulae, see JZ]). All monadic inductive 
formulae are elementary and canonical. 

Conjecture 5.15. All modal formulae on which SQEMA succeeds are locally equivalent 
to inductive formulae. 

6. Concluding remarks and further work 

The algorithm SQEMA is presented here in its core version, which is apparently already 
quite powerful. In particular, we are currently not aware of examples whereby SQEMA fails 
on a modal formula on which the current implementations of either SCAN or DLS succeed. 
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Furthermore, SQEMA is amenable to various extensions, some of which are being de- 
veloped in forthcoming sequels |S1 IHl IZ| to this paper. In particular: 

Monotonicity-based version: While the condition on the formula B{P) in the for- 
mulation of Ackermann's lemma is stated in syntactic terms, the proof of that lemma 
uses a semantic property of viz upwards, resp. downwards monotonicity with 

respect to P. Thus, the lemma can be strengthened accordingly to a strong, or se- 
mantic Ackermann's lemma, which accordingly yields a stronger, semantic version 
of the respective Ackermann rule used by SQEMA. Moreover, testing a modal for- 
mula for monotonicity in a propositional variable occurring in it is decidable, which 
enables the algorithmic use of such rule in a stronger version of SQEMA, albeit at 
the price of possibly higher complexity. 

Polyadic languages: The algorithm can be accordingly extended to work in arbi- 
trary polyadic modal languages (presented as purely modal languages in |15| I17j ) 
and to succeed on all polyadic inductive formulae (ibid.). 

Complex formulae: Furthermore, it can be extended with suitable rules to succeed 
on the so called complex formulae introduced in These can be converted to 

inductive formulae by using rather non-trivial substitutions. 

Least fixed points: As shown in ^7], all regular formulae (in arbitrary polyadic 
languages) have equivalents in first-order logic extended with least fixed points 
FO-I-LFP. These equivalents can be computed by a suitable extension of SQEMA 
with a recursive version of the Ackermann-rule. For instance, that extension suc- 
ceeds on the Godel-Lob formula □(□p — > p) — > Op and on Segerberg's induction 
axiom IND = [2]{q [l]g) ^ (g — > [2]<?). Note that these formulae are not 
canonical anymore. 

Theory reasoning: The algorithm SQEMA is amenable to further extensions with 
rules capturing theory reasoning, which enables computing first-order correspon- 
dents relativized over classes of frames, and thus succeeding on notorious cases of 
elementary canonical formulae, such as modal reduction principles over the class of 
transitive frames. 
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